SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLIOJ^T: Sullivan, Francis 
Kriz, Ron 
Kumar, Ravindra 

(ii) TITLE OF I>JVENTICN: Human GDP-Mannose Hydratase 

(iii) NL-MBER OF SEQUENCES: 5 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genecics Institute, Inc. 

(B) STREET: 87 CaiubridgePark Drive 

(C) CITY: Cambridge 

(D) STATE: Massachusetts 

(E) COUNTRY: USA 

(F) ZIP: 02140 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NL'MBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFCRyJVTION : 

(A) NAME: Brown, Sccct A. 

(B) REGISTRATION NU>EER: 32,724 

(C) REFERENCE/DOCKET NL"MBER: GI5285 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 498-8224 

(B) TELEFAX: (617) 875-5851 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
GCCGCGTTCC TGCCGGCACC GCGCCTGCCC TCTGCCGCGC TCCGCCCTGC CGCCGACCGC 
ACGCCCGCCG CGGGACATGG CACACGCACC GGCACGCTGC CCCAGCGCCC GGGGCTCCGG 
GGACGGCGAG ATGGGCAAGC CCAGGAACGT GGCGCTCATC ACCGGTATCA CAGGCCAGGA 
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TGGTTCCTAC 


CTGGCTGAGT TCCTGCTGGA 


GAAAGGCTAT 


GAGGTCCATG 


GAATTGTACG 


240 


GCGGTCCAGT 


TCATTTAATA CGGGTCGAAT 


TGAGCATCTG 


TATAAGAATC 


CCCAGGCTCA 


300 


CATTGAAGGA 


AACATGAAGT TGCACTATGG 


CGATCTCACT 


GACAGTACCT 


GCCTTGTGAA 


360 


GATCATTAAT 


GAAGTAAAGC CCACAGAGAT 


CTACAACCTT 


GGAGCCCAGA 


GCCACGTCAA 


420 


AATTTCCTTT 


GACCTCGCTG AGTACACTGC 


GGACGTTGAC 


GGAGTTGGCA 


CTCTACGACT 


480 


T CTAG ATGCA 


GTTAAGACTT GTGGCCTTAT 


CAACTCTGTG 


AAGTTCTACC 


AAGCCTCAAC 


540 


AAGTGAACTT 


TATGGGAAAG TGCAGGAAAT 


ACCCCAGAAG 


GAGACCACCC 


CTTTCTATCC 


600 


CCGGTCACCC 


TATGGGGCAG C^AAACTCTA 


TGCCTATTGG 


ATTGTGGTGA 


ACTTCCGTGA 


660 


GGCGTATAAT 


CTCTTTGCAG TGAACGGCAT 


TCTCTTCAAT 


CATGAGAGTC 


CCAGAAGAGG 


720 


AGCTAATTTC GTTACTCGAA AAATTAGCCG 


GTCAGTAGCT 


AAGATTTACC 


TTGGACAACT 


780 


GGAATGTTTC 


AGTTTGGGAA ATCTGGATGC 


CAAACGAGAT 


TGGGGCCATG 


CCAAGGACTA 


840 


TGTGGAGGCT 


ATGTGGTTGA TGTTGCAGAA 


TGATGAGCCG 


GAGGACTTCG 


TTATAGCTAC 


900 


TGGGGAGGTC 


CATAGTGTCC GGGAATTTGT 


CGAGAAATCA 


TTCTTGCACA 


TTGGAAAAAC 


960 


CATTGTGTGG GAAGGAAAGA ATGAAAATGA AGTGGGCAGA 


TGTAAAGAGA 


CCGGCAAAGT 


1020 


TCACGTGACT GTGGATCTCA AGTACTACCG 


GCCAACTGAA 


.GTGGACTTTC 


TGCAGGGCGA 


1080 


CTGCACCAAA GCGAAACAGA AGCTGAACTG 


GAAGCCCCGG 


GTCGCTTTCG 


ATGAGCTGGT 


1140 


GAGGGAGATG 


GTGCACGCCG ACGTGGAGCT 


CATGAGGACA 


AACCCCAATG 


CCTGAGCAGC 


1200 


GCCTCGGAGC 


CCGGCCCGCC CTCCGGCTAC 


AATCCCCGCA 


GAGTCTCCGG 


TGCAGACGCG 


1260 


CTGCGGGGAT 


GGGGAGCGGC GTGCCAATCT 


GCGGGTCCCC 


TGCGGCCCCT 


GCTGCCGCTG 


1320 


CGCTGTCCCG 


GCCGCAAGAG CGGGGCCGCC 


CCGCCGAGGT 


TTGTAGCAGC 


CGGGATGTGA 


1380 


CCCTCCAGGG 


TTTGGGTCGC TTTGCGTTTG 


TCGAAGCCTC 


CTCTGAATGG 


CTTTGTGAAA 


1440 


TCAAGATGTT 


TTAATCACAT TCACTTTACT 


TGAAATTATG 


TTGTTACACA ACAAATTGTG 


1500 


GGGCCTTCAA ATTGTTTTTC C 
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(2) INFORMATION FOR SEQ ID NO: 2: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 354 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Gly Lys Pro Arg Asn Val Ala Leu He Thr Gly He Thr Gly Gin 
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Asp Gly Ser Tyr Leu 
20 

His Gly He Val Arg 
35 

His Leu Tyr Lys Asn 
50 

His Tyr Gly Asp Leu 
65 

Glu Val Lys Pro Thr 
85 

Lys He Ser Phe Asp 
100 



Ala Glu Phe Leu Leu Glu 
25 

Arg Ser Ser Ser Phe Asn 
40 

Pro Gl.-i Ala His He Glu 



Thr Asp Ser Thr Cys Leu 
70 75 

Glu lie Tyr Asn Leu Gly 
90 

Leu Ala Glu Tyr Thr Ala 
105 



Lys Gly Tyr Glu Val 
30 

Thr Gly Arg He Glu 
45 

Gly Asn Met Lys Leu 
60 

Val Lys He He Asn 
80 

Ala Gin Ser His Val 
95 

Asp Val Asp Gly Val 
110 



Gly Thr Leu Arg 
115 

Ser Val Lys Phe 
130 



Leu Leu Asp Ala 
120 

Tyr Gin Ala Ser 



Val Lys Thr Cys 

Thr Ser Glu Leu 
140 



Gly Leu He Asn 
125 

Tyr Gly Lys Val 



Gin Glu He Pro Gin Lys Glu Thr 
145 150 

Tyr Gly Ala Ala Lys Leu Tyr Ala 
165 

Glu Ala Tyr Asn Leu Phe Ala Val 
180 



Thr Pro Phe Tyr Pro Arg Ser Pro 
155 160 

Tyr Trp He Val Val Asn Phe Arg 
170 175 

Asn Gly He Leu Phe Asn His Glu 
185 190 



Ser Pro Arg Arg Gly Ala As" Phe 
195 • 200 

Val Ala Lys He Tyr Leu Gly Gin 
210 215 

Leu Asp Ala Lys Arg Asp Trp Gly 
225 230 

Met Trp Leu Met Leu Gin Asn Asp 
245 

Thr Gly Glu Val His Ser Val Arg 
260 

His He Gly Lys Thr He Val Trp 
275 230 



Val Thr Arg Lys He Ser Arg Ser 
205 

Leu Glu Cys Phe Ser Leu Gly Asn 
220 

His Ala Lys Asp Tyr Val Glu Ala 
235 240 

Glu Pro Glu Asp Phe Val He Ala 
250 25S 

Glu Phe Val Glu Lys Ser Phe Leu 
265 270 

Glu Gly Lys Asn Glu Asn Glu Val 
285 



Gly Arg Cys Lys Glu Thr Gly Lys Val His Val Thr Val Asp Leu Lys 
290 295 300 

Tyr Tyr Arg Pro Thr Glu Val Asp Phe Leu Gin Gly Asp Cys Thr Lys 
305 310 315 320 

Ala Lys Gin Lys Leu Asn Trp Lys Pro Arg Val Ala Phe Asp Glu Leu 
325 330 335 
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Val Arg Glu Met Val His Ala Asp Val Glu Leu Met Arg Thr Asn Pro 
340 345 350 



Asn Ala 



INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIP 

Met Ala His Ala Pro 
1 5 

Gly Glu Met Gly Lys 
20 

Gly Gin Asp Gly Ser 
35 

Glu Val His Gly He 
50 

He Glu His Leu Tyr 
65 

Lys Leu His Tyr Gly 
85 

He Asn Glu Val Lys 
100 

His Val Lys He Ser 
115 

Gly Val Gly Thr Leu 
130 

He Asn Ser Val Lys 
145 

Lys Val Gin Glu He 
165 

Ser Pro Tyr Gly Ala 
180 

Phe Arg Glu Ala Tyr 
195 

His Glu Ser Pro Arg 
210 



TION: SEQ ID NO : 3 : 

Ala Arg Cys Pro Ser Ala 
10 

Pro Arg Asn Val Ala Leu 
25 

Tyr Leu Ala Glu Phe Leu 
40 

Val Arg Arg Ser Ser Ser 
55 

Lys Asn Pro Gin Ala His 
70 75 

Asp Leu Thr Asp Ser Thr 
90 

Pro Thr Glu He Tyr Asn 
105 

Phe As? Leu Ala Glu Tyr 
120 

Arg Leu Leu Asp Ala Val 
135 

Phe Tyr Gin Ala Ser Thr 
150 1S5 

Pro Gin Lys Glu Thr Thr 
170 

Ala Lys Leu Tyr Ala Tyr 
185 

Asn Leu Phe Ala Val Asn 
200 

Arg Gly Ala Asn Phe Val 
215 



Arg Gly Ser Gly Asp 
15 

He Thr Gly He Thr 
30 

Leu Glu Lys Gly Tyr 
45 

Phe Asn Thr Gly Arg 
60 

He Glu Gly Asn Met 
80 

Cys Leu Val Lys He 
95 

Leu Gly Ala Gin Ser 
110 

Thr Ala Asp Val Asp 
125 

Lys Thr Cys Gly Leu 
140 

Ser Glu Leu Tyr Gly 
160 

Pro Phe Tyr Pro Arg 
175 

Trp He Val Val Asn 
190 

Gly He Leu Phe Asn 
205 

Thr Arg Lys He Ser 
220 
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Arg Ser Val Ala Lys 
225 

Gly Asn. Leu Asp Ala 
245 

Glu Ala Mec Trp Leu 
260 

lie Ala Thr Gly Glu 
275 

Phe Leu His lie Gly 
290 

Glu Val Gly Arg Cys 
305 

Leu Lys Tyr Tyr Arg 
325 

Thr Lys Ala Lys Gin 
340 

Glu Leu Val Arg Glu 
355 



He Tyr Leu Gly Gin Leu Glu Cys Phe Ser Leu 
230 235 240 

Lys Arg Asp Trp Gly His Ala Lys Asp Tyr Val 
250 255 

Mec Leu Gin Asn Asp Glu Pro Glu Asp Phe Val 
265 270 

Val His Ser Val Arg Glu Phe Val Glu Lys Ser 
230 285' 

Lys Thr He Val Trp Glu Gly Lys Asn Glu Asn 
295 300 

Lys Glu Thr Gly Lys Val His Val Thr Val Asp 
310 315 320 

Pro Thr Glu Val Asp Phe Leu Gin Gly Asp Cys 
330 335 

Lys Leu Asn Trp Lys Pro Arg Val Ala Phe Asp 
345 350 

Met Val His Ala Asp Val Glu Leu Met Arg Thr 
360 365 



Asn Pro Asn Ala 
370 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligoucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
TGATGAGCCA GAGGACTTTG TCATAGCTAC 30 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligoucleotide 
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(xi) SEQUENC2 DESCRIPTION: SEQ ID NO 
CAGAAAGTCC ACTTCAGTCG GTCGGTAGTA 
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